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Abstract 

Reproducibility is increasingly important to statistical research m, 
but many details are often omitted from the published version of com¬ 
plex statistical analyses. A reader’s comprehension is limited to what the 
author concludes, without exposure to the computational process. Often, 
the industrious reader cannot expand upon or validate the author’s results. 

Even the author may struggle to reproduce their own results upon revis¬ 
iting them. R Markdown is an authoring syntax that combines the ease of 
Markdown with the statistical programming language R. An R Markdown 
document or presentation interweaves computation, output and written 
analysis to the effect of transparency, clarity and an inherent invitation 
to reproduce (especially as sharing data is now as easy as the click of a 
button). It is an open-source tool that can be used either on its own or 
through the RStudio integrated development environment (IDE) [I5 r In 
addition to facilitating reproducible research, R Markdown is a boon to 
collaboratively-minded data analysts, whose workflow can be streamlined 
by sharing only one master document that contains both code and con¬ 
tent. Statistics educators may also find that R Markdown is helpful as 
a homework template, for both ease-of-use and in discouraging students 
from copy-and-pasting results from classmates. Training students in R 
Markdown will introduce to the workforce a new class of data analysts 
with an ingrained, foundational inclination toward reproducible research. 

The scientific method emphasizes reproducibility as a key component to cor¬ 
roborating and extending results. While noble in theory, there are roadblocks 
to realizing reproducibility in statistics and data analysis, chief among them 
an outdated reliance on copy-and-pasting between computational environments 
and text editors. Dividing labor in this way: 1) introduces trivial errors; 2) 
allows selective reporting; 3) shields the data analysis from public appraisal; 4) 
impedes intuitive workflow; 5) complicates iterative analysis on new data; 6) 
makes collaboration awkward; and 7) is time-consuming. Copying tables and 
output from one window to another creates opportunity for errors, such as out¬ 
put incongruously placed beside the wrong written analysis. It also necessitates 
authorial discretion in deciding which parts of what output are moved into the 
final report, which can lead to misrepresentation of results. In the classroom, 

*MassMutual Data Labs, Amherst, MA 01027, dana.udwin@gmail.com 
fSmith College, Northampton, MA 01063, bbaumer@smith.edu 


1 



the copy-and-paste workflow offers a chance to fudge numbers or cop class¬ 
mates’ figures, and may saddle the instructor with a messy, patchwork report to 
grade. Peer review, critique, or even further work is not straightforward when 
the publication does not include code or computation, or includes it in separate, 
difficult-to-navigate files. 

R Markdown [1J is an open-source authoring format that interweaves written 
analysis and statistical computation to produce documents, presentations, and 
other types of reports. It can be used directly or through the RStudio IDE, and 
relies on plain markdown syntax—built on the ethos that a source file should 
be readable before rendering |8I—and the statistical programming language R. 
A wide array of user-contributed packages, freely available documentation, and 
a plethora of user-moderated blogs, tutorials, and message boards make R re¬ 
sponsive to users’ needs. R Markdown is equivalently responsive to users’ needs. 
The code in an R Markdown document is reevaluated every time the document 
is rendered, enabling the report to reflect changes in data. The final output 
contains code and written analysis where the author(s) wrote it into the source 
file, as well as output following those generating commands. Such transparency 
enhances readers’ comprehension and invites review. Even on the authorial side, 
integrating computation and textual interpretation creates a natural workflow 
and smoothes difficulties in collaboration. If there are multiple authors, they 
can use online file-sharing services to modify a single R Markdown document 
rather than pass back-and-forth dissociated written and computational compo¬ 
nents that do not respond to changes in the other. The learning curve with 
R Markdown is relatively shallow due to its simple and well-documented syn¬ 
tax. In short, blending written analysis and statistical computation through R 
Markdown is an elegant means to reproducibility. 

1 Overview 

Creating scientific documents is complicated by the necessity of including mul¬ 
tiple kinds of information: text, figures, code, and mathematical symbols. 
DTf^X (and its predecessor Tf^X) has become the state-of-the-art for scien¬ 
tific papers—due in part to its beautiful and careful rendering of mathematical 
elements—while Microsoft Word is more commonly used in disciplines that re¬ 
quire less mathematical notation. However, neither DTf^X nor Word provides 
the ability to actually compute with data within a document. R Markdown 
provides this functionality in a straightforward and easy-to-use manner. 

Like DT^X or HTML, but unlike Word, R Markdown employs a source file 
and output file paradigm. That is, commands and sentences are typed by the 
user directly into a source file—which is just a plain text file written in a certain 
format—and this source file is then rendered into an output file. The source file 
typically may only be read by the author, who will then distribute the output 
file for public consumption. 

Yet R Markdown offers four main advantages over DTf^X, Word, or HTML 
for statistical analysis: 
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• Simplicity: Markdown syntax is far simpler than DTf^X or HTML. While 
the text-formatting capabilities are not as feature-rich, they are sufficient 
for most purposes. 

• Readability: Markdown syntax is designed so that even the source file is 
human-readable |7 . Conversely, LTpX and HTML source documents can 
be very difficult to parse visually. 

• Transparency: All formatting is encoded clearly in a Markdown source 
file. Conversely, formatting in Word can involve navigating a complex and 
occasionally inscrutable system of drop-down menus and option windows. 

• Embedded Computation: An R Markdown document contains R code in 
its source file, and then the processed results of those commands (along 
with the commands themselves) in the rendered output file. 

Moreover, R Markdown can produce an output document in either PDF, 
Word, or HTML format from a single source file. 

Thus, integrating source code, statistical output, and text in R Markdown is 
a model of reproducibility. Such transparency facilitates comprehension, defen- 
sibility, and further research or testing. R Markdown helps to bring the vision for 
reproducibility in statistical analysis articulated in [6] to reality. This vision— 
in which the barriers to verify another’s statistical computations from start to 
finish are low—is the intellectual descendant of [3] and [4], and began with 
Knuth mi, the creator of T^X. Moreover, R Markdown is dynamic. Each time 
the document is rendered, the commands therein are run anew: if data are al¬ 
tered or different data are called in advance of rendering, then the output is 
dependent and calculated accordingly m ■ 

Before returning to more technical details, we give a brief example of how 
R Markdown can be used by statistics students for homework assignments (or 
analysts for reports, et cetera). 

1.1 An Example Homework Assignment 

In Figure [lj we present an example of how a homework assignment for an 
introductory statistics student can be written in R Markdown. On the left is the 
R Markdown source file into which the student would type. Lines 2-5, sandwiched 
above and below by three hyphens, contain header information (the syntax is 
YAML [5]). All but the output: ht ml .document is printed in the rendered 
output shown on the right; this output designation means that the rendered 
output is an HTML document. Alternative output formats are PDF and Word. 

The R Markdown source file contains both written text and “chunks” of R 
code, demarcated by sets of three backticks. The RStudio editor shown at left 
automatically highlights these chunks in light grey. 

Options specific to each chunk may be included in curly backets. For exam¬ 
ple, message=FALSE omits package-loading messages from the rendered output. 
One can see in the rendered output that R Markdown prints statistical output 
immediately below the chunk of R code with which it is associated. 
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File Edit Code View Plots Session Build Debug Tools Help 

- s'* * Project (None) » 


©j Homework.Rmd x 



Knit HTML ' 

^^Run Q Chunks' 

1' 

• | - 


2 

title: "Homework” 


3 

author: "suzie smith" 


4 

5 

date: "Tuesday, October 28, 2014" 
output: html_document 


6' 

. - 


7 



8' 

'''{r, message=FALSE} 


9 

libraryOnosai c) 


10' 



11 



12 

1. what variables are in the KidsFeet data set? 


13 



14 ’ 

'{r} 


15 

names(KidsFeet) 


16' 



17 



18 

Note that the 'echo = false' parameter was added 

to the code chunk 2 


. use a visual to compare foot length by sex. 


19 



20' 

'"{r, fig.height=2, fig.width=2} 


21 

bwplot(length ~ sex, KidsFeet) 


22' 



23 



24 



1:1 

Q (Top Level) i 

R Markdown i 


Console R Markdown 

C:/Users/Dana/OneDrive/Documents/ O 


Homework.html I £Tj Open in Browser Publish 

Homework 

Suzie Smith 

Tuesday, October 28, 2014 

library (mosaic) 

1. What variables are in the KidsFeet data set? 

names(KidsFeet! 

## [1] "name" "birthmonth" "birthyear” "length" "width” 

## [6] "sex” "biggerfoot" "domhand" 

Note that the echo = false parameter was added to the code chunk 2. Use a visual to compare foot 
length by sex. 

bwplot(length ~ sex, KidsFeet) 


27 - 



22 - 


B G 


Figure 1: Example of a homework assignment, input (left) and output (right) 


The ease-of-use and transparency inherent to R Markdown makes it a suit¬ 
able tool in the introductory statistics classroom. The alignment of code and 
output in the final rendered document drives home the connection between the 
R commands and the output they create, while eliminating the risk of making a 
mistake when copy-and-pasting output into a text editor. Displaying code in the 
submitted assignment forces students to not simply present a table, model or 
other output, but also to be able to trace the computational origin of their out¬ 
put. This may discourage students from cheating. Students leave the semester 
with a thorough understanding of the reproducible document and its benefits. 
On the educators’ side, grading and troubleshooting a single R Markdown doc¬ 
ument is less cumbersome than juggling separate written and computational 
components, such as a Word document and an R script. 

2 Syntax 

R Markdown is one of many technologies aiming to provide simple but powerful 
authoring environments—as opposed to complex, monolithic word-processing 
applications. Specifically, markdown refers to an increasingly popular plain¬ 
text authoring syntax designed for simple text documents. For example, when 
creating installation instructions for a software application, a developer may 
wish to include simple, functional formatting elements like lists and links, but 
will likely be reluctant to devote time to curating an elaborate visual appearance 
for said document (e.g. multiple fonts). R Markdown is an implementation of 
markdown and includes additional functionality to process output from R. 
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2.1 Markdown 


In this section we will illustrate some of the simple text formatting features of 
markdown. Note that these have nothing to do with R. 

markdown offers enhanced ease-of-use over other authoring languages like 
UTfrJX or HTML. For example, in order to display the word data in italics, we 
use 

\textit{data} 

in UTfeX, 

<i>data<\i> 

in HTML, and an uncluttered 
*data* 

in markdown. Additional markdown syntax for customizing text is intuitive 
and straightforward. 

A large heading can be made by underlining a line of text with at least three 
equal signs. The same construction with hyphens will create a smaller heading. 

Formatting commands are equally simple. For example, asterisks, hyphens 
and plus signs all produce a bulleted list, while either ordinal numbers or pound 
signs (“#.”) create a numbered list. A “greater than” symbol creates a block 
quote, while typing three backticks above and below text sets the text apart in 
a fixed-width box (see Figure [2|. 

Creating hyperlinks is as easy as including a URL in parentheses—the text 
that links to the URL immediately precedes the URL in square brackets. Bold¬ 
face and italic text can be created by enclosing that text with two or one asterisks 
or underscores, respectively. Carets provide superscripts and tildes enable sub¬ 
scripts or strike-through text. Images stored locally or remotely (via a URL) can 
be embedded in the final output as well (see Figure [2|. RStudio automatically 
color-codes text in the source file to distinguish between differently formatted 
text and computation (as shown in Figure [l]). 

Tabular information can be typed within appropriately placed strings of 
hyphens that print as a table in the final rendered output (see Figure [3]). Tables 
can be made more complex if necessary. 

Although markdown does not provide support for all UTf^X commands, it can 
render UTf^X equations wrapped in dollar signs using MathJax. More generally, 
for advanced users who already know HTML, markdown will pass chunks of 
HTML code through to its output. 

2.2 R + Markdown 

As noted above, R Markdown provides a particularly valuable extension to mark¬ 
down for statistical analysis, because it enables R code to be embedded in the 
source file and rendered as output. There are two ways to include R code in an 
R Markdown document: 
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14 [Hyper 1i nks] (http://wires.wi1ey.com/wi1eyCDA/wiresJournal/wisld-wics 
.html) can be useful. 

15 

16 _This_ and this** are bold-faced. 

17 - *This* and _this_ are italicized. 

18 + There are many ways to make a list. 

19 1. Numbered lists are possible. 

20 

21 #. And there are many ways 

22 #. to make a numbered list. 

23 

24 > A block quote 

25 > looks neat. 

26 
27- 

28 A block quote can also appear in a "chunk" that isn’t evaluated. 

29* 

30 

31 include an equation like $1+1$ inline or on its own: $$1+1$$ 

32 

33 you may need to use superscript^*, subscript-2- or a —strike—. 

34 

35 ! [You may want to include an image. ](http://upload.wikimedia.org 
/wikipedia/commons/a/af/Farm-Fresh_pi_math.png) 

36 


2:18 O (Top Level) * 

R Markdown J 


nn^nlp R Marlcrlnwn 

1 1 


Hyperlnks can be useful. 

• This and this are bold-faced. 

• 777£and this are itaidzed. 

• There are many ways to make a 1st. 

1. Numbered ists are possible. 

2. And there are many ways 

3. to make a numbered 1st. 

A block quote looks neat. 


A block quote can also appear in a "chunk" that isn’t evaluated. 

Include an equation ike \(1+1\) inine or on its own: \[1+1\] 

You may need to use superscript 2 , subscript-2- or a -strike-. 

% 


Figure 2: Example of R Markdown syntax and capabilities, input (left) and 
output (right) 


56 - - 

57 Column 1 Column 2 

58 -------— 

59 1 A 

60 

61 2 B 

62 

63 3 C 

64 T - 


Column 1 

1 

2 

3 


Column 2 

A 

B 

C 


Figure 3: A table created with inline notation, input (left) and output (right) 


• Chunks: a block of R code offset from the main text 

• Inline: a single line of R code appearing within the main text 

A chunk—which is executed and printed with the associated computational 
output when the R Markdown document is rendered—is created by including 
three backticks before and after a block of R code. This is the most common way 
to incorporate R commands. Figure [4] shows two separate chunks. The command 
in the first chunk is printed in the rendered document. The second chunk invokes 
the echo=FALSE option so as not to print the plot (cars) command, although 
the plot itself still prints. 

Alternatively, code can be included in an R Markdown document inline with 
text, sandwiched between single backticks. Inline R code is evaluated, but 
not highlighted like “chunks.” The first line of text in Figure [4] includes c r 
nrow(cars) f , which evaluates to “50” in the final document. 

Each R Markdown document is rendered in a separate, new workspace. Thus, 
all R packages as well as data and other objects required for a particular com¬ 
mand to be run in the R Markdown document must be loaded previously in 
a chunk. Including mess age=FALSE in the chunk header suppresses messages 
generated during evaluations, a useful option when composing an R Markdown 
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Pi Homework.Rmd x 


_4 Run 14 Q Chunks' 


? - s® - Knit HTML - 


39- '{r} 

40 knitr::kable(head(cars)) 


43 There are r nrow(cars)' observations n the cars data set. 

44 

45 A visual is helpful (and we don’t want to print the underlying R code). 

46 

47- {r helpful_visual, fig.width=2, fig. height=3|, echo=FALSE} 

48 plot(cars) 


31 Chunk 2: helpful.visual i 


Console R Markdown 

PI .../Documents/Homework.Rmd 

atlijax —variable^ "mathjax-uri:https:7/cdn. mathjax. org/mathjax/latest/MathJax. is 
?config=Tex-AMS-MML_H7MLorMML" --no-highlight --variable highlightjs=Homework_fi 
les/highlight 

output created: Homework.html 


knitr::kable(head(cars)) 



There are 50 observations n the cars data set. 

A visual s helpful (and we don't want to print the underlying R code). 



Figure 4: Different uses of an R Markdown “chunk” 
(right) 


input (left) and output 


report that requires packages but does not need verbose output (see Figure [TJ. 
Analogous options exist for warnings and errors. Informing students of this 
functionality can improve the readability of the rendered output, since many 
packages produce long, uninformative messages when they are loaded. 

There are various other chunk options. include=FALSE will suppress print¬ 
ing both code and output in the final rendered document, but will still evaluate 
the chunk when the document is rendered, results^hide ; includes the code 
but hides the output. The echo=FALSE option suppresses code but includes out¬ 
put. These options can be useful when, for example, you might want to generate 
a plot on an exam, but not reveal to students the commands necessary to draw 
that plot. Or, one can write an exam with embedded solutions, but suppress 
those solutions with echo=FALSE to generate an exam copy. It is wise to name 
your chunks (as modeled by the final chunk in Figure 4, called helpf ul_visual), 
so that the error report generated in the case of failed rendering can pinpoint 
the problematic chunk by name. Chunk options for plots like fig.width and 
fig.height set the size for plots created in the chunk, as shown in Figure [l] 
and Figure [4j 

Often, an author will desire uniform chunk options throughout their report. 
In this case, there are R commands to specify global chunk options, which 
one might include in a chunk that include=FALSE hides in the final rendered 
document. It is also possible to defer output to the end of a document, as a 
means of automatically creating a technical appendix. 
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2.3 Nuts and Bolts 


R Markdown —which was developed by the RStudio team—works in conjunction 
with the knitr package [20!—which was created by Yihui Xie as part of his 
graduate work at Iowa State University [19 . knitr is the successor of Sweave [12'. 
Upon his graduation, Xie was hired by RStudio, and is now the maintainer of 
the R Markdown package. Thus, while R, markdown, R Markdown, and knitr 
were developed separately by independent groups of people, RStudio now quite 
deliberately maintains the R Markdown portion of this universe. In this case, 
the result is a seamless integration of R Markdown functionality—using knitr —in 
RStudio. 

As noted above, markdown is a general-purpose text authoring format for 
documents, and R Markdown is simply markdown, with functionality for com¬ 
puting with R injected into it. knitr is the rendering engine that converts an R 
Markdown document into HTML. In fact, knitr is capable of much more, includ¬ 
ing rendering Sweave documents that combine UTgX and R code. The extensive 
leveraging of existing technologies imbues R Markdown with powerful function¬ 
ality. The current verson of R Markdown uses Pandoc m — an all-purpose text 
file conversion program—to dramatically increase its versatility and long-term 
viability. For example, it is now possible to make an entire modern-looking 
website (such as the R Markdown website itself) using R Markdown and writing 
just a few lines of HTML code. 

3 Workflows 

In this section, we discuss some common situations in which workflows may be 
significantly streamlined through the adoption of R Markdown. 

3.1 Student Homework 

Instructors at several institutions have found R Markdown to be a useful tool for 
student homework in both introductory and higher-level statistics courses [2]. 
Having a student’s work—both computational and analytical—in one document 
provides benefits to both the student and their instructor. The transparency 
created by integrating code and written analysis simplifies troubleshooting and 
dissuades cheating. Underlying R Markdown is an emphasis on reproducibility 
that students may carry forward in future coursework and careers. 

R Markdown also subverts the temptation to copy-and-paste statistical out¬ 
put from the computation environment into a text editor like Word. Flitting 
from window to window increases the likelihood of errors, such as misaligning 
statistical output with the wrong exposition. A less benign consequence of the 
copy-and-paste workflow is an increase in the number o f opportunities for stu¬ 
dents to selectively report results. R Markdown mitigates the risk of selective 
reporting by requiring all code to be contained within and printing all output by 
default. In addition, weaving together statistical computation and written anal- 


ysis is intuitive in a way that mirrors statisticians’ process and betters readers’ 
comprehension. 

3.2 Collaborative Research 

As a well-documented open-source programming language, R itself is already 
an emblem of accessibility. Moreover, we have argued that publishing code, 
output, and written analysis in an integrated document fosters collaboration 
and substantive critique. A variety of blogs and forums host active information 
exchanges for debugging and development. Designing a package specific to one’s 
needs (there are 5,696 packages at last count [9]) is a characteristic activity for 
an advanced R user. Many R projects employ the version-control system Git 
through the web-sharing service GitHub to facilitate group development [ T4l . 
Indeed, GitHub uses GitHub-flavored markdown for its text files, so R Markdown 
users will feel right at home in that system. More generally, R Markdown belongs 
to the same ethos of collaborative development. (In fact, the markdown package 
for R is hosted on GitHub!) 

More advanced features are available to R Markdown users who desire publication- 
quality appearance. One of these nearly 6,000 R package, xtable, coerces data 
to DT^Xor HTML tables. Citations—another crucial component of fully trans¬ 
parent research—are easily incorporated by referencing a bibliography file in 
the R Markdown document’s title header (i.e. bibliography: bib_name .bib). 

To cite an entry from the bibliography, type [@item_ID] in the markdown text 
environment, where item_ID is the citation identifier in the bibliography. 

Statistical analysis is increasingly the work of teams, and R Markdown facili¬ 
tates collaboration within these teams by keeping the narrative and computation 
inherent in data analysis together in a single place. Functionality provided by 
packrat [17—yet another contribution from the RStudio team—enables seamless 
package version management. 

3.3 RStudio Integration 

Although R Markdown is developed by RStudio, one can use R Markdown and 
knitr outside of the RStudio IDE. However, users of RStudio will find several 
tightly integrated features for working with R Markdown. 

R Markdown is among the file types listed in the RStudio File —>• New File 
drop-down menu (see Fig. [5|. After electing to build an R Markdown docu¬ 
ment in RStudio, the user may choose an output type (document, presentation 
or Shiny) or rely on a template. Templates are typically found within the 
inst/rmarkdown/templates directory of an R package and are an opportunity 
to create content using a standardized format. Some popular formats (e.g. the 
Journal of Statistical Software template in the rticles package [18]) are pub¬ 
licly available, but users are free to create their own. 

RStudio offers additional flexibilty after the author has built their R Mark¬ 
down source file and is ready to render the document. The “knit” button has a 
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New R Markdown 


- Document 
Pi Presentation 
0 Shiny 
H From Template 


Title: 

Author: 


Homework 


Suzie Smith| 


Default Output Format: 

® HTML 

Recommended formatter authoring [you can switch to PDF 
or Word output anytime). 

Opdf 

PDF output requires TeX [MiKTeX on Windows, MacTeX 
2013+ on OS X, TeX Live2013+ on Linux). 

(._) Word 

Previewing Word documents requires an installation of MS 
Word [or Libre/Open Office on Linux). 


( OK j Cancel 


Figure 5: R Markdown dialog box to create a new document, input (left) and 
output (right) 


drop-down option that offers different output options depending on if the user 
is building a document, a presentation or a Shiny web application. 

4 Output Formats 

If using R Markdown in the RStudio IDE, then the document is rendered by 
clicking a button labeled “Knit HTML”; equivalently, calling the: 

rmarkdown::render( ; doc.rmd ; ) 

function outside RStudio does the job. 

The current version of R Markdown is based on Pandoc as well as knitr, and 
can therefore produce any of HTML, PDF, and Word document types. Beamer, 
ioslides, and reveal. j s presentations are also possible. Interactivity has been 
introduced through integration with Shiny, a web application framework with 
R. 

In what follows, we outline the three major types of output documents. 

4.1 Document 

The most common use for R Markdown is to generate static documents. 

After filling in the empty fields and selecting an output format from HTML, 
PDF, or Word in the dialog box (Fig. [5]), a new document with a pre-filled YAML 
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HomewGrk.htrnl £Ll Open in Browser 


&l Homework.Rmd 

^ Knit HTML - _^Run 

1 - — 

2 title: "Homework" 

3 author: "suzie smith 11 

4 date: "Tuesday, October 23, 2014" 

5 output: html_document 

6 - - 

7 

8 Header #1 
10 

11 Header #2 
!2 t - 


Chunks^ 


Homework 

Suzie Smith 

Tuesday, October28, 2014 

Header #1 

Header #2 


Figure 6: Example of a YAML document header, input (left) and output (right) 


header is created (see Figure [6]). Several additional parameters can be added 
to the header, such as table of contents (“toe:”), themes, or output formats 
which can be rendered simultaneously. For maintaining a consistent look-and- 
feel across multiple documents (e.g. a website), the YAML header may be 
saved as a separate file. Any R Markdown documents in the same directory will 
automatically use this header information. 

In addition, the latest version of R Markdown allows for footnotes and cita¬ 
tions with comparable ease. 

4.2 Presentation 

R Markdown can generate presentations in either HTML (ioslides, reveal.js) 
or PDF (Beamer) format. The former takes advantage of newer features in 
HTML5, and can be viewed in any modern web browser. Conversely, the latter 
requires a local installation of T^X. Similar to documents, a user can seamlessly 
render the same R Markdown presentation in either output format, without 
changing the content of the source file. 

The format for R Markdown presentations is similar to that of documents, 
with individual slides demarcated by double pound signs (##)• As with R 
Markdown documents, the YAML header controls various options, including the 
overall display size (e.g. widescreen), text size, bullet formatting, and transition 
speed. The header will also take a logo option so that an image of the author’s 
choice is projected onto the title page and slide footers. Options like f ig_height 
and f ig_width control the default figure size through the presentations. 

Outside the header, two pound signs signifies the start of a new slide. CSS 
attributes—which can also be defined in an external file—in curly brackets follow 
the slide title. In Figure [7| the “.flexbox” and “.vcenter” attributes enable 
center-aligned text. Calling class="red2" creates colored text. Bulleted lists 
mimic the syntax in an ordinary R Markdown environment, as do chunk options 
and R commands. 
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v>- S * 



Presentation. Rmd x 


? » j® 1 Knit HTML - 

_4 

1 ’ 

• - 


2 

title: "Presentation" 


3 

author: "suzie smith" 


4 

date: "Tuesday, October 28, 2014" 


5 

output: ioslides_presentation 


6 ’ 

•- 


7 



8 - 

## slide #1 {.flexbox .vcenter} 


9 



10 

<div class="red2"> 


11 

you can center text. 


12 

</div> 


13 



14 - 

## slide #2 {.columns-2} 


15 



16 

- column A 


17 



18 

- Column B 


19 



20 - 

## slide #3 


21 



22 - 

''*{r, echo=FALSE} 


23 

plot(cars) 


24 * 



25 



26 



3:21 

Q (Top Level) ; 



Console R Markdown 


Slide #1 


You can center text. 


Figure 7: R Markdown ioslide presentation, input (left) and output for Slide 1 
(right) 


Slide #1 


Slide #2 


Presentation_example 


Slide #3 



Figure 8: Rendered ioslides 
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a 0 


4 Pro 


Shiny.Rmd* x 

I O 0 T Ok 


? - 


^fRun 5.4 Q Chunks- 


► Run Document ® 

1 - — 

2 title: "shiny" 

3 author: "suzie smith" 

4 date: "Tuesday, October 28, 2014" 

5 output: html_document 

6 runtime: shiny 
7» — 

8 

9” '"{r, echo=FALSE> 

10 inputPanel( 

11 selectlnput("n_breaks", label = "Number of bins:", 

12 choices = c(30, 40, 50, 60), selected = 40), 

13 

14 sliderlnput("bw_adjust", label = "Bandwidth adjustment:", 

15 min = 0.05, max = 1, value = 0.05, step = 0.05) 


18 - renderPlot({ 

19 hist(carsSspeed, breaks = as.numeric(input$n_breaks), 

20 xlab = "Distance", main = "cars") 

21 

22 dens - density(cars$speed, adjust = input$bw_adjust) 

23 lines(dens, col = "red") 


R Markdown ; 


Console R Markdown 

G/Users/Dana/OneDrive/Documents/ ^ 


’help.startO ’ for an HTML browser interface to help. 
Type 'qO‘ to quit R. 


Shiny 

Suzie Smith 

Tuesday, October 28, 2014 


Number of bins: 
40| 

30 

□ 

50 


Bandwidth adjustment: 



Figure 9: Shiny code that generates an interactive plot, input (left) and output 
(right) 


4.3 Shiny 

Shiny is a web application framework for R that can be leveraged in R Markdown 
to enable interactivity. As opposed to the document and presentation outputs, 
which are static, a Shiny application is dynamic, and contains elements that the 
reader can manipulate. 

Shiny code can be used to generate dynamic plots, as demonstrated in Figure 
[9] with the renderPlotO function. The histogram and density plots called 
in renderPlotO rely on user input parameters defined in the inputPanelO 
function. 

Alternatively, an entire Shiny application can be embedded in the document, 
by either defining it within a chunk using the shinyAppO function or calling a 
Shiny application defined elsewhere using the shinyAppDir () function. Figure 
[To] shows shiny AppO at work. 

5 Conclusion 

R Markdown is a statistical authoring tool that meets high standards of usabil¬ 
ity, reproducibility, and functionality. For conducting research and assembling 
a report, R Markdown offers an intuitive workflow that allows for interspersed 
computation and written analysis in a single source file. Combining statistical 
programming and text in one platform eases the strain of collaboration when 
there is more than one researcher at work. Because the R code is reevaluated 
every time an R Markdown document is rendered modifications to the data are 
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Suzie Smith 

Tuesday, October 28, 2014 
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Figure 10: Full Shiny application defined within an R Markdown chunk, input 
(left) and output (right) 


automatically reflected in the output of subsequent renderings. Statisticians 
with changing data need no longer spend time recoding and copy-and-pasting 
output. They also have increased flexibility for sharing their work: presenta¬ 
tion or document, static, or with dynamic user-controlled elements, with all the 
text-formatting customization available in markdown. In terms of possible data 
manipulation, visualization and general computational capabilities, their arse¬ 
nal is as expansive as R: that is, virtually limitless, as R is open-source with a 
growing number of packages. When the paper or presentation is complete and 
the final product distributed, readership can survey work that is transparent, 
with potentially every table, figure and computation matched to the R code 
responsible. The discriminating reader can trust that the work is sound (or 
diagnose missteps in approach) and may be empowered to reproduce the re¬ 
search. Publishing computational process alongside results facilitates academic 
dialogue. 

R Markdown is also a natural fit in the statistics classroom (introductory or 
advanced), where using one platform to import, clean, analyze, and interpret 
data provides a streamlined workflow. Submitting one document for grading in 
turn simplifies the instructor’s work, who would otherwise review isolated out¬ 
put without specific information as to what went wrong in calculation. When 
students compile a semester’s work to study for exams, they will review doc¬ 
uments that clearly associate code and output. In short, R Markdown satisfies 
the call for reproducibility in scientific research while also improving workflow. 
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